Fuzzy Knnmodel Applied to Predictive Toxicology Data Mining
نویسندگان
چکیده
A robust method, fuzzy kNNModel, for toxicity prediction of chemical compounds is proposed. The method is based on a supervised clustering method, called kNNModel, which employs fuzzy partitioning instead of crisp partitioning to group clusters. The merits of fuzzy kNNModel are two-fold: (1) it overcomes the problems of choosing the parameter ε allowed error rate in a cluster, and the parameter N minimal number of instances covered by a cluster, for each data set; (2) it better captures the characteristics of boundary data by assigning them with different degrees of membership between 0 and 1 to different clusters. The experimental results of fuzzy kNNModel conducted on thirteen public data sets from UCI machine learning repository and seven toxicity data sets from real-world applications, are compared with the results of fuzzy c-means clustering, k -means clustering, kNN, fuzzy kNN, and kNNModel in terms of classification performance. This application shows that fuzzy kNNModel is a promising method for the toxicity prediction of chemical compounds.
منابع مشابه
Data Mining Technology in Predicting the Cultivated Land Demand
Although data mining is relative young technique, it has been used in a wide range of problem domains over the past few decades. In this paper, the authors present a new model to forecast the cultivated land demand adopts the technique of data mining. The new model which is called fuzzy Markov Chain model with weights ameliorate the traditional Time Homogeneous Finite Markov chain model to pred...
متن کاملThe Prediction of AIDS Survival: A Data Mining Approach
Data mining methods which combine techniques from database research, statistics and artificial intelligence have been applied to many research disciplines including engineering, finance and medicine.The objective of this paper is to describe the feasibility of applying a predictive data mining technique to predict the survival of AIDS. An adaptive fuzzy regression technique, FuReA, was used to ...
متن کاملA Neuro-Fuzzy Approach for Data Mining and
Data mining is a technique for discovering hidden knowledge of strategic importance from large databases. This technique is currently being successfully used by a number of real-world applications. Data mining is a tool for increasing the productivity of people trying to build predictive models. Both disciplines have been working on problems of pattern recognition and classification. Rule induc...
متن کاملUsing Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach
Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...
متن کاملHorizontal Generalization Properties of Fuzzy Rule-Based Trading Models
We investigate the generalization properties of a data-mining approach to single-position day trading which uses an evolutionary algorithm to construct fuzzy predictive models of financial instruments. The models, expressed as fuzzy rule bases, take a number of popular technical indicators on day t as inputs and produce a trading signal for day t+ 1 based on a dataset of past observations of wh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- International Journal of Computational Intelligence and Applications
دوره 5 شماره
صفحات -
تاریخ انتشار 2005